Add live audio transcription streaming support to Foundry Local JS SDK by rui-ren · Pull Request #486 · microsoft/Foundry-Local

rui-ren · 2026-03-05T19:25:15Z

Here's the updated PR description with the renamed types:

Title: Add live audio transcription streaming support to Foundry Local JS SDK

Description:

Adds real-time audio streaming support to the Foundry Local JS SDK, enabling live microphone-to-text transcription via ONNX Runtime GenAI ASR.

The existing AudioClient only supports file-based transcription. This PR introduces LiveAudioTranscriptionClient that accepts continuous PCM audio chunks (e.g., from a microphone) and returns partial/final transcription results as an async iterable.

What's included

New files

src/openai/liveAudioTranscriptionClient.ts — Streaming client with start(), pushAudioData(), getTranscriptionStream(), stop(), dispose()
src/openai/liveAudioTranscriptionTypes.ts — LiveAudioTranscriptionResult and CoreErrorResponse interfaces, tryParseCoreError() helper

Modified files

src/imodel.ts — Added createLiveTranscriptionClient() to interface
src/model.ts — Delegates to selectedVariant.createLiveTranscriptionClient()
src/modelVariant.ts — Implementation (creates new LiveAudioTranscriptionClient(modelId, coreInterop))
src/index.ts — Exports LiveAudioTranscriptionClient, LiveAudioTranscriptionSettings, LiveAudioTranscriptionResult, CoreErrorResponse

API surface

const audioClient = model.createAudioClient();
const session = model.createLiveTranscriptionClient();

session.settings.sampleRate = 16000;
session.settings.channels = 1;
session.settings.language = "en";

await session.start();

// Push audio from microphone callback
await session.pushAudioData(pcmBytes);

// Read results as async iterable
for await (const result of session.getTranscriptionStream()) {
    console.log(result.text);
}

await session.stop();

Design highlights

Internal async push queue — Bounded AsyncQueue<T> serializes audio pushes from any context (safe for mic callbacks) and provides backpressure. Mirrors C#'s Channel<T> pattern.
Retry policy — Transient native errors retried with exponential backoff (3 attempts); permanent errors terminate the session
Settings freeze — Audio format settings are snapshot-copied and Object.freeze()d at start(), immutable during the session
Buffer copy — pushAudioData() copies the input Uint8Array before queueing, safe when caller reuses buffers
Drain-on-stop — stop() completes the push queue, waits for the push loop to drain, then calls native stop
Dispose safety — dispose() wraps stop() in try/catch, never throws

Native core dependency

This PR adds the JS SDK surface. The 3 native commands (audio_stream_start, audio_stream_push, audio_stream_stop) are routed through the existing execute_command / execute_command_with_binary exports. The code compiles with zero TypeScript errors without the native library.

Testing

✅ TypeScript compilation — 0 errors across all source files
⏳ Integration tests pending native core delivery

Parity with C# SDK

This implementation mirrors the C# LiveAudioTranscriptionSession (branch ruiren/audio-streaming-support-sdk) with identical logic:

Same session lifecycle: start → push → getStream → stop
Same push loop with retry and permanent error handling
Same settings freeze and buffer copy semantics
Same drain-before-stop ordering
Same renamed types: LiveAudioTranscription* (matching C# rename)

vercel · 2026-03-05T19:25:21Z

The latest updates on your projects. Learn more about Vercel for GitHub.

Project	Deployment	Actions	Updated (UTC)
foundry-local	Ready	Preview, Comment	Mar 13, 2026 9:41pm

support audio streaming-js

9a1578c

update js package

4cf6cb4

vercel bot deployed to Preview March 13, 2026 21:41 View deployment

rui-ren requested review from kunal-vaishnavi March 13, 2026 21:44

rui-ren changed the title ~~Add real-time audio streaming support (Microphone ASR) - JS~~ Add live audio transcription streaming support to Foundry Local JS SDK Mar 13, 2026

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Add live audio transcription streaming support to Foundry Local JS SDK#486

Add live audio transcription streaming support to Foundry Local JS SDK#486
rui-ren wants to merge 2 commits intomainfrom
ruiren/audio-streaming-support-sdk-js

rui-ren commented Mar 5, 2026 •

edited

Loading

Uh oh!

vercel bot commented Mar 5, 2026 •

edited

Loading

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

1 participant

Conversation

rui-ren commented Mar 5, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

What's included

New files

Modified files

API surface

Design highlights

Native core dependency

Testing

Parity with C# SDK

Uh oh!

vercel bot commented Mar 5, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

1 participant

rui-ren commented Mar 5, 2026 •

edited

Loading

vercel bot commented Mar 5, 2026 •

edited

Loading